1/1 


A0-A1S8  891  TESTS  FOR  PATTERNED  ALTERNATIVESOJ)  PENNSYLVANIA  STATE 
UNIV  UNIVERSITY  PARK  DEPT  OF  STATISTICS 
T  P  HETTHANSPERGER  ET  AL.  AUG  85  TR-56  N88814-88-C-8741 

F/G  6/4 


UNCLASSIFIED 


NL 


85  S  03 


DEPARTMENT  OF  STATISTICS 


The  Pennsylvania  State  University 
University  Park,  PA  16802  U.S.A. 


TECHNICAL  REPORTS  AND  PREPRINTS 

Number  56:  August  1985 

TESTS  FOR  PATTERNED  ALTERNATIVES 

Thomas  P.  Hettmansperger* 
Professor  of  Statistics 
The  Pennsylvania  State  University 

and 

Robert  M.  Norton 

Associate  Professor  of  Mathematics 
The  College  of  Charleston 

and 

Visiting  Associate  Professor  of  Statistics 
The  Pennsylvania  State  University 


s 


DTIC 

electe 

SEP  5  1985 


*The  work  of  the  author  was  partially  supported  by  ONR  Contract 
N00014-80-C0741. 


"DISTRIBUTION  STATEMENT  A 
Approved  to*  public  raleoMl 


Distribution  Unlimited 


3 


/ 


Abstract 

— y  This  paper  treats  the  problem  of  testing  for  a  patterned 
alternative  in  a  one-  or  two-way  layout.  Ordered  and  umbrella 
alternatives  are  special  cases.  Statistics  based  on  combined 
rankings  and  on  pairwise  rankings  are  developed.  Modifications 
necessary  to  incorporate  covariates  are  also  included.  ^ 
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1 .  Introduction 


In  this  paper  we  consider  the  problem  of  testing  for  a 
patterned  alternative.  In  two  sample  comparisons  It  Is  well 
known  that,  when  appropriate,  a  one  sided  critical  region  Is 
more  powerful  than  a  two  sided  region.  The  situation  with  more 
than  two  samples  Is  more  complex  since  there  are  various  ways  to 
specify  an  alternative.  These  alternatives  are  the  patterns  we 
wish  to  detect. 

The  ordered  alternative  In  a  one-way  layout  Is  a 
generalization  of  the  one  sided  alternative.  There  Is  an 
extensive  literature  In  this  area  and  It  continues  to  be  an 
active  area  of  research.  See  Barlow  et.  al.  (1972)  for  an 
overview.  A  monotonic  ordering  of  locations  Is  only  one  of  many 
patterns  that  may  be  of  Interest  to  the  researcher.  Umbrella 
alternatives,  studied  by  Hack  and  Wolfe  (1981),  provide  another 
Important  example. 

In  Section  2  we  consider  testing  for  patterned  alternatives 
In  a  one-way  layout.  Given  a  pattern,  we  show  how  to  construct 
a  most  efficient  rank  test  within  a  certain  class  of  rank 
tests.  We  provide  two  versions  of  these  test  statistics:  one 
based  on  average  ranks  after  ranking  the  combined  data  and  one 
based  on  pairwise  rank  statistics  formed  by  ranking  only  within 
each  of  the  pairs  of  samples.  The  statistics  based  on  the 
average  ranks  have  the  advantage  of 


i 


being  quite  easy  to  apply  In  a  large  variety  of  situations. 

Some  researchers  may  prefer  the  statistic  based  on  pairwise 
comparisons  because  the  pairwise  statistics  provide  direct 
Information  on  the  sources  of  statistical  significance. 

We  also  discuss  the  problem  of  detecting  when  a  sample 
pattern  conforms  to  a  pattern  within  a  finite  set  of  possible 
alternative  patterns.  In  Section  4  we  Illustrate  this  approach 
on  the  umbrella  alternative  when  the  peak  Is  unknown.  Tables  of 
critical  values  are  provided  for  this  problem. 

In  Section  3  we  consider  the  problem  of  testing  for  a 
pattern  when  there  are  covariate  measurements  available. 

2.  The  Test  Statistics 

Suppose  Is  distributed  with  continuous  cumulative 
distribution  function  F(x-0j)  for  j*1,...,k  and  1*1 ,  ....n^, 
and  the  observations  are  mutually  Independent.  We  wish  to 
consider  tests  of  HQ:  0.j*...*0k  versus  H^:  ©j  ■  ©q  *  ®cj’ 

0  >  0,  j*l,...,k,  where  c.j,...,  ck  Is  a  given  set  of  constants. 
The  set  of  constants  c.j,...,ck  specifies  the  pattern  to  be 
detected.  For  example,  c1  <  ...  <  c^  determines  an  ordered 
alternative  and  c.j<...<  c^>  c^+1  >  ...  >  c^  determines  an 
umbrella  alternative  with  peak  at  0^.  The  researcher  will 
generally  specify  the  constants;  and,  unless  there  are 
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Indlcatloos  to  the  contrary,  we  recommend  equally  spaced 
constants. 

Let  Rjj  be  the  rank  of  In  the  combined  data  and  let 
Rj  -  nj1  E.jR^  be  the  average  rank  of  the  jth  group.  We  will 
consider  statistics  of  the  form  V  *=  EajRj  where  Ea^  «  0.  For 
simplicity  throughout  this  paper,  we  will  take  N  *  En^.  Also, 
for  N  fixed  we  let  «  n^/H,  J  while  the  asymptotic 

results  hold  for  n^/N  -♦  XjC(0,l)  as  H 

The  following  two  theorems  follow  at  once  from  the  results 
In  Hettmansperger  (1984,  Chapter  4). 


Theorem  1 .  If  HQ:  0^...-  ©k  Is  true,  then  EV-0  and  Var  V 
«  (N(N+1  )/12]  E  aj2/nj,  where  M  =  En^.  Further,  as  N  -»  •, 


V* 


where  Z  ~  n(0,l)  means  Z  has  a  standard  normal  distribution. 


Theorem  2.  If  H/w:  0j  *  ®0  *  0CjN-1/2,  0  >  0,  j«l,...,k 
Is  true  and  F  has  a  density  f  with  Xf2(x)dx  <  ®,  then 

0 

V*  -*  Z  -  n(0e,l ) 


as  N  -*  ®,  where 


e 


Eaj(crcw) 

(Eaj2/\j)1/2 


(12)1/2  /f 2(x)dx 


Hence,  If  we  reject  Hq:  0-1  =  ...=©^  when  V*  >  la  where 
1  -  0(Za)  =  a  and  $(  )  Is  the  standard  normal  distribution 
function,  then  for  large  N,  the  test  based  on  V*  is  approximately 
size  a  and  has  approximate  local  power  1  -  $(Za  -  0e).  The 
quantity  e  Is  called  the  Pitman  efficacy  of  the  test.  If  two 
tests  have  efficacies  e1  and  e2,  respectively,  then  the  efficiency 
of  V.j*  relative  to  V?*  Is  e^/e*.  A  test  largest  e  has 
maximum  asymptotic  local  power  and  Is  most  efficient.  The  best 
test  of  the  form  V*  Is  given  In  the  next  theorem. 

Theorem  3.  Given  the  pattern  c1 . c^,  e  Is  maximized  by 

aj  =  \)^Cj  ”  V  where  *-j  *  nj^N*  . and  cw  *  EXjCj. 

Lagrange  multipliers  may  be  used  to  derive  the  above  expression 
for  a^  as  the  unique  solution  candidate,  and  the  Cauchy-Schwarz 
Inequality  used  to  confirm  that  e  Is  maximized.  For  a  proof,  see 
the  appendix. 

Henceforth,  we  take 

v  -  EVjtCj  -  cw)Rj  (2.1) 

where  *  nj/N,  N  =  ^nj  and  *  EXjCj.  Furthermore,  from 
Theorem  1,  If  HQ:  0^...*©^  Is  true,  EV=0,  Var  V 
-  [(N+l )/l 2]  E\j  ( c j  -  cw)2  and 
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Z  -  N(0,1 ) .  (2. 


^(Cj-CwlRj  0 


<^j(Cj-Cw)2)'/2 
Hogg  (1965)  Illustrates  the  substantial  Increase  In  power  that 
results  when  this  type  of  test  Is  used  rather  than  an  omnibus 
test.  Hogg  considers  the  test  based  on  sample  means  rather  than 
average  ranks,  but  the  effect  Is  similar. 

It  will  be  useful  to  Introduce  a  slightly  different  form 
for  V*.  Let 


Then 


V*  * 


V/z  (CJ  -  CW> 

[£  \1(C,  -  cw)2]l/2  ’ 
1 


V2  E  bj  Xjl/2  Rj 


(2. 


(2. 


and  E  \j1/2  bj  =  0,  E  bj2  «  1. 

Recall,  the  researcher  chooses  a  pattern  c^,...,ck,  or 

equivalently,  ^.....b^.  If,  on  the  other  hand,  no  pattern 

or  only  a  vague  pattern  Is  specified  then  we  could  use  Max  V* 

where  we  maximize  over  a  collection  of  choices  of  b^,...,bk 

subject  to  the  appropriate  constraints. 

For  example.  In  testing  H^:  ©1  =...=  0^  versus 

H^:  e^s  not  all  equal,  we  would  use  Max  V*  where  the 

1  /2 

maximum  Is  taken  over  all  b.j,...,bk  such  that  E\j  bj  =  0 
2 

and  Eb.  =  1.  The  Cauchy-Schwarz  Inequality  Implies  that 


[EVj(Rj  -  <N+1)/2)2]l/2 
The  constraints  are  satisfied  and 

Max  V*  «  [(12/(n+l))  E\j(Rj  -  (N+l )/2)2]1/2  (2.6) 

«  Hl/2 

where  H  Is  the  Kruskal-Wallls  (1952)  rank  statistic  for  the 

one-way  layout.  Hence,  under  Hq,  Max  V*  Is  distributed  like 

2 

the  square  root  of  x  (k-1)  and  can  be  used  to  make  the  test. 

Next,  consider  HQ:  0.j*...«0k  versus  H^: 

0.j<. ..<  0^  with  at  least  one  strict  Inequality.  We 
propose  requiring  c^ . . . . .c^  to  be  equally  spaced  unless 
there  are  Indications  to  the  contrary.  Hence,  take  Cj=j, 
j-1,...,k  and  the  statistic  (2.1)  becomes 

V  -  EXjU  -  Jw)Rj  (2.7) 

where  jw  =  EjXj.  This  has  a  convenient  asymptotic 
distribution  theory  as  outlined  above.  When  the  sample  sizes 
are  equal,  n^*...*^,  we  have  V  =  k-1  E  (j  -  (k+l)/2)Rj 
and  Var  V  *  (k+l)(k-l)(N+l)/144  which  is  quite  easy  to  use. 

If  no  spacing  can  be  specified,  then  we  would  use  Max  V* 

where  the  maximum  is  taken  over  choices  of  ^.....b^  such 

1/2  2  -l  /2  -1  /2 

that  EVj  bj*0,  Eb^  =  1  and  V,  b1  <...<  bk. 

Using  an  argument  similar  to  that  of  Hogg  (1965,  p. 11 56)  we  have 
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Max  V*  =  [(12/(N+1 ) )  £Vj[Rj  -  (N+l )/2]  ] 


20/2 


(2.8) 


=  [X2  11/2 

lxrankJ 


where  R1  <.  ..<  R12  is  the  Isotonic  regression  of 

Rlf...,Rk  with  weights  Vj,...,^  and  x2ank  is  Chacko's 

(1963)  rank  statistic  for  ordered  alternatives.  See  Barlow 

et.al.  1972,  p.  201)  for  a  discussion.  Special  tables  are 

_2 

required  for  the  application  of  xpank  and  hence  when 
reasonable,  we  recommend  using  (2.7). 

Finally,  we  consider  the  umbrella  alternative  for  which 
we  have  Hq:  0^...=©^  versus  H^:  01  <.  ..<  >  ©t+1 

>. ..>  ek.  We  treat  the  case  of  the  peak  specified  first, 
i.e.  the  case  when  t  is  known.  Again,  unless  otherwise 
Indicated,  we  would  take  equally  spaced  c.j,...,ck,  say 

Cj  »  j,  j=l .  t  and  Cj  *  2t-j  for  j=t+l,...,k,  and  use 

V*  (2.2)  to  carry  out  the  test. 

If  the  peak  Is  unknown,  we  Impose  the  restriction  that 

c1 . ck  be  equally  spaced.  Then  there  are  k  umbrella 

alternatives,  including  the  two  monotone  alternatives  at  either 
extreme.  We  propose  to  use  Max  V*  where  the  maximum  is  taken 
over  a  finite  collection  of  possible  patterns  of  c^,...,ck. 

The  Index  at  which  the  maximum  occurs  estimates  the  peak. 

This  is  Illustrated  in  section  4. 

In  general,  there  may  be  p  possible  sets  of  coefficients 
c-|,...,ck.  We  now  describe  the  distribution  theory  needed  to 


Implement  max  V*.  Let  V*'  =  (Vi*. . . . ,Vp*)  be  a  vector  of 

mm 

statistics  with 

£*-j(ctj  ~  ctw)Rj 
1/2  J 

-  (2.9) 

tE\j(ctj  -  ctw>2]1/2 

J 

for  t*l,...,p.  A  standard  calculation  shows  that  under  Hg, 

E\j(csj  -  csw)(ctj  -  ctw) 

cov(vs*,vt*)  =  - - - - - — (2.10) 

[Z\j(csj  -  csw)2  E\j(c^j  -  Ctw)2]1/2 
J  j 

In  terms  of  btj,  t*l,...,p,  j=l,...,k,  defined  In  (2.3)  we 
have  the  covariance  matrix  of  V*  given  by 

mm 

cov(V*)  *  BB'  (2.11) 


Vt 


12 

N+1 


where  B  =  (btj).  The  asympotlc  distribution  is  given  by  the 
next  theorem. 

Theorem  4.  Suppose  Hg:  0-j*..."©^  is  true.  If  N  ->  »  In  such 

D 

a  way  that  nj/N  -*\j  c  (0,1),  j=l,...,k,  then  V*  -»  BZ,  where 

mm  mm 

Z  ~  MVN(0, I) . 

mm  mm  mm 

Standard  arguments,  given  for  example  in  Hettmansperger 
(1984,  Chapter  4),  show  that  V*  has  a  limiting  MVN(0,  BB*) 
distribution,  and  BZ,  where  Z  ~  HVN(0,I),  has  the  same 
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distribution. 

It  further  follows  that,  under  HQ, 

D 

max  V*  -»  max  BZ.  (2.12) 

The  distribution  of  max  BZ  is  Intractable  and  can  be  computed 
or  approximated  only  in  a  few  very  special  cases.  See,  for 
example,  Johnson  and  Kotz  (1970).  However,  it  is  quite  simple 
to  simulate  probabilities  of  Max  BZ.  If  the  observed  value  of 

M 

V*  is  v*,  then  the  appropriate  P-value  of  a  test  based  on 
max  V*  is 

P(max  BZ  >  max  v*) .  (2.13) 

MM  M 

This  is  easy  to  simulate  since  all  we  need  is  a  vector  Z  of  iid 

M 

n(0,l)  variates  along  with  B.  We  Illustrate  this  in  section  4 

M 

for  umbrella  alternatives  with  unspecified  peak.  We  also 
provide  a  table  of  selected  simulated  critical  values  for  this 
problem  (Table  4.1  and  4.2).  Finally,  note  that  if  we  store 

r'*(Ri  ....  ,"R|<)  and  the  matrix  A  =  dlag  (V|1/2 . X|<1/2), 

then  we  can  compute  V*  =  (12/(N+1  ))^2  BAR. 

Terpstra  (1952)  and  Jonckheere  (1954)  proposed  a  test  for 
ordered  alternatives  based  on  pairwise  Mann-Whitney-Wllcoxon 
statistics.  The  statistic  Is  J*  EE  U-jj  where  U^j  is  the 

number  of  times  observations  In  the  jth  sample  exceed 
observations  in  ith  sample.  Mack  and  Wolfe  (1981)  propose 
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At  =  EE  U-t 4  +  EE  U<i  for  testing  an  umbrella  alternative 
1<j<t  t<Kj 

with  peak  at  t.  Work  of  Fairley  and  Pearl  (1984)  suggests 
that  neither  the  combined  ranking  nor  pairwise  ranking  method 
Is  superior.  Hence,  for  completeness,  we  provide  statistics 

ft 

which  are  equivalent  In  a  Pitman  efficiency  sense  to  V  but 
based  on  pairwise  rankings.  This  affords  an  easier  comparison 
of  our  statistics  with  those  of  Hack  and  Wolfe  for  umbrella 
alternatives. 

The  following  lemma  provides  the  link. 

Lemma.  Given  such  that  Ed^  *  0  and  any  quantities 

Z-| , . . .  ,Zk,  we  have 

k  k-1  k 

k  E  djZj  =  E  E  <dj  -  di)(Zj  -  Zi). 
j=l  1-1  j=1+l 

The  proof  Is  by  direct  algebrlac  manipulation. 

* 

To  convert  V  Into  a  form  that  Is  In  terms  of  pairwise 
comparisons,  we  would  let  Z^  =  and  dj  =  (Cj  -  cfw). 

Then  to  move  to  pairwise  rankings  we  replace  Rj  -  R^  by 
W1j  81  ni1  "j1  U1j*  Hence*  the  statistic  which  we  consider 
Is 

U*  =  k  EE(dj  -  d-|)W^j  (2.14) 

with  d,  =  \,(c,  -  c  ).  Under  the  null  hypothesis 
j  j  J  w 
* 

Hg:  EU  =0  and 


(di-di)2 

Var  U"  =  —  {EE  — - 

12  Kj  n-jnj 


dj  di 

+  k  EE  —  —  (dj  -  di)} 
Kj  nj  n1 


(2.15) 


See  the  appendix  for  a  proof.  Furthermore,  under  the  null 
*  -1  /2  * 

hypothesis,  (Var  U  )  U  has  a  limiting  standard  normal 

distribution.  Because  of  the  complexity  of  the  variance 
* 

formula,  U  Is  of  doubtful  practical  value.  However,  If 

we  have  equal  sample  sizes,  the  formulas  can  be  simplified 
* 

considerably  and  U  can  be  used. 

Suppose  n.j  =  ...=nk  =  n,  so  \j=k-1  for  j*l . k. 

*  -l /?  * 

Then,  under  the  null  hypothesis,  (Var  U  )  U  Is 
asymptotically  n(0,l)  with 


u*  .  L  EE  (Cj  -  Cl)  u,j 
n*  Kj 

and,  since  EE  ( c -i  — c i ) 2  =  k  E(c^-^)2, 

Kj  j 

k 

Var  U*  =  — ! —  (k+1)  E  (cj-c)2  < 

12nk  n  j»l 

k 

«  (nk+1)  £  (C1-£)2. 

12n2k  j=l 

Calculations,  similar  to  those  of  Theorem  2,  show 
that  U*  and  V*  have  the  same  efficacy  and  same  asymptotic 
local  power.  We  can  now  Illustrate  the  difference  between 
our  approach  of  selecting  c-j,...,^  based  on  equal  spaclngs 


(2.16) 


(2.17) 


m  *  •  “  ■  "  •  r  «  “  •  '**•*»  1  «  ‘  •*  ,  *  ■  •  »  '  •  '  •  *  ,  *  •  "  •  “  *  "  »  *  *  •  •  *  «  ?  ^  •  .  t  • 


and  the  Hack-Wolf e  approach  of  piecing  together  two 
Jonckheere-Terpstra  statistics  for  testing  the  umbrella 
alternative.  For  example,  take  k=5  and  t=3.  Then,  we  have 
n2U*  =  U-J2  +  2U-J3  +  U23  +  U14  -  U34  -  2  U35  -  U45  -  U25 
and  the  Mack-Wolfe  statistic  Is  equivalent  to 

A*  *  U-J2  +  Ul3  +  u23  ■  u34  ~  u35  ”  u45* 

The  statistic  U*  utilizes  U14  and  U25,  which  make  the 

* 

comparisons  across  the  peak.  These  are  excluded  from  A  and 
can  result  In  some  loss  of  efficiency.  The  exclusion  could 
have  unpleasant  consequences  In  the  peak  unknown  case,  as 
discussed  In  a  prototype  example  In  section  4. 

We  feel  that  the  statistic  V*  (or  U*  when  practical) 
provides  a  natural  and  easy  way  to  Implement  a  test  for  a 
patterned  alternative.  If  the  researcher  does  not  feel 
equally  spaced  alternatives  are  appropriate  and  can  provide 
rough  Information  on  the  spaclngs,  then  it  Is  a  simple  matter 
to  choose  the  appropriate  weights  c-j,..,ck. 

The  approach  described  above  extends  quite  easily  to 
the  additive  two-way  layout  with  b  blocks  and  k  treatments. 
Suppose  1=1,..., b,  j=l,...,k  and  t=l,...,n^j,  are 

mutually  Independent  random  variables  with  respective 
continuous  distribution  functions  F(x-0^).  Further, 
suppose  0^j  =  11  +  ♦  0Cj,  0  >  0,  and  c1 . c^  specify 
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the  pattern.  The  additivity  implies  that  the  pattern  is 

propogated  through  the  b  blocks.  We  wish  to  test  Hq:  0=0 

versus  H.:  0  >  0. 

A 

The  data  should  be  ranked  within  blocks.  Then  R^ 

Is  the  rank  of  X^t  when  ranked  among  the 

k  _ 

N-j  *  E  n-j j  observations  in  the  1th  block.  Let  denote 

the  average  rank  In  the  (1,j)  cell.  Results  similar  to 
those  In  Theorems  1-3  show  that  we  should  use 


b  k 

V  -  E  E  Xi^Cj-cwJRij  (2-18) 

1-1  j-1 


k 

where  Vfj  =  n1j/N1  an<l  c^w  -  E  VjjCj.  Under  the  null 

J*1 

hypothesis  Hq:  0=0,  EV=0  and 

b  k 

Var  V  =  J_  E  (Nj+1)  E  Mi(ci-Ciw)2.  (2.19) 

12  1=1  j=l 

-1  /2 

Further,  (Var  V)  V  has  asymptotically  a  standard  normal 
distribution  as  n^  -*  «,  1=1,..., b,  j=l,...,k,  and  b  remains 
fixed  or  as  b  -*  »  and  n^  remains  fixed.  Hence,  It  Is  easy  to 
construct  approximate  critical  values  for  the  test  based  on  V. 
When  the  sample  sizes  are  equal,  n^=n  for  all  1  and 


j,  and  the  formulas  can  be  simplified. 


In  this  case,  Xij=l/k,  N^-nk,  c^k-1  Ecj-c,  and  so  we  can 


consider: 


b  k 

v'  -  E  E  (Ci-c)Rii 
1-1  j-1 


(2 


1  £  (Cj-c)Rj 
j-1 


where  R^  Is  the  sura  of  ranks  In  the  (1,j)  cell  and  R^  Is  the 


sum  of  ranks  under  treatment  j.  Further, 
,  bn2k(nk+l)  k 


Var  V 


12 


£  (ci-c)2 

j-1  J 


(2. 


For  example.  If  n=l  and  Cj-j,  so  we  are  considering  the 


additive  two-way  layout  with  one  observation  per  cell  and 


ordered  alternatives,  then  V  Is  Page's  (1963)  statistic  and 
2,1.2 


Var  V  =  bk  (k  -l)(k+l)/144.  The  statistic  V,  (2.18), 
generalizes  and  Improves  the  statistics  In  Hettmansperger 
(1975). 


3.  Covariate  Adjusted  Tests  In  a  One-Way  Layout. 

It  may  happen  that  the  researcher  has  a  set  of  covariate 
measurements  available  along  with  the  treatment  measurements. 
It  then  becomes  necessary  to  correct  or  adjust  the  rank  test 
statistic  In  order  to  test  for  a  patterned  alternative.  We 
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will  assume  a  parallelism  model  for  the  covariate  so  the 
same  slope  parameter  applies  to  every  treatment;  otherwise, 
there  Is  an  Interaction  between  the  covariate  and  treatment 
factors,  and  a  test  for  treatment  differences  Is  difficult 
or  Impossible  to  Interpret. 

It  Is  convenient  to  cast  the  one-way  layout  In  terms 
of  a  linear  model.  The  observations  X^,  j«l,...,k, 

1*1,..., nj,  will  be  arranged  In  a  vector  Y  of  length  N-Enj. 

mm 

The  first  n^  components  comprise  the  first  sample  and  so  on. 
Then,  If  there  are  q  covariates,  the  model  Is  written  as: 

Y  -  la  ♦  W0  ♦  Zy  ♦  e  , 

0m  mm  mm  M 

where  1  Is  an  N  x  1  vector  of  ones,  W  Is  an  N  x  k  matrix  In 
which  the  1th  column  has  ones  corresponding  to  the  1th 
sample  and  zeros  otherwise,  Z  Is  an  N  x  q  matrix  of  covariates, 
a  Is  the  scalar  Intercept  parameter,  0'  «  (©it..., 0k)  are  the 

treatment  parameters  of  Interest  and  y'  *  (yi . Yq)  gives 

the  slope  parameters.  The  mean  centered  matrices  will  be 
denoted  by  Wc  and  Zc.  For  example,  Wc  *  W  -  N—1 11 ' w. 

mm  mm 

The  matrix  WcWc  Is  singular  and  we  will  assume  that  the 
matrix  Z^  Zc  Is  nonsingular.  Finally,  the  vector  e'  * 

mm 

(e-j . e^)  of  random  errors  are  assumed  to  have  Independent 

and  Identically  distributed  random  variable  components  from 
a  continuous  distribution  with  unique  median  0. 
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As  before,  we  are  Interested  In  testing  Hq  : 

=  . .  .*=0^  versus  H^:  0j  =  0Q  +■  Gc^,  j=l . k, 

0  >  0  and  c1 . ck  a  given  pattern.  The  test  will  be 

based  on  average  ranks,  after  the  data  have  been  adjusted 
for  the  covariates.  Suppose  y  Is  an  estimate  of  y  based 

M 

on  the  reduced  model  under  Hq,  Y  =  la  +  Zy  +  e.  We 

then  rank  Y  -  Zy  and  use  R-| , . . . , ,  the  average  ranks  of 
0*0  0*0 

the  k  adjusted  samples.  In  equation  (2.1)  to  form  a  test 
statistic.  The  following  theorem,  which  Is  proven  In  the 
Appendix,  provides  the  asymptotic  result  needed  to  approximate 
critical  values. 


Theorem  5.  Suppose  Hq:  0-| * . . . “0|c  Is  true.  If  N1/2(y_Y) 

*0 

Is  bounded  In  probability  and  If  N-1  l'c  Zc  has  a  positive 
definite  limit  as  N  -»  «,  then 

(\l/2  k  D 

31  1  E  \j(crew)Rj  -»  Z  -  n(0,l )  (3.1) 

N+lJ  o  j»l 

where 


■=  E  \<(c-j-cw)2  -  N-^ 
j-1 


c  wczc 


(ZCZC) 


-1 


ZCWCC 


(3.2) 


C  *  (M”Cw,  •  '  •  ,C|("Cw)  • 
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This  result  corresponds  to  (2.2)  and  provides  the  covariate 
adjustment  for  the  standardization  of  the  test  statistic. 
Hence,  we  cannot  simply  adjust  the  data  and  apply  (2.2);  the 
variance  must  be  adjusted  also. 

Note  that  the  variance  adjustment,  given  as  the  second 
term  In  (3.2),  is  non-negative.  This  follows  since 

•  I 

Z  (Z  Z  )  Z  Is  a  non-negative  definite  projection  matrix. 

t  v  v 

When  there  Is  only  one  covariate,  the  correction  term  In 

(3.2)  Is  [^(Cj-cjZ^]2  /  [N'1E(Z1-Z)2]  where 

Zcj  Is  the  average  of  the  nj  components  of  Zc,  corresponding 

mm 

to  the  jth  sample.  This  means  that  If  (2.2)  rather  than 
(3.1)  Is  applied  to  the  adjusted  data,  an  asymptotically 
conservative  test  will  result.  It  Is  easy  to  construct 
examples  for  which  the  correction  term  Is  substantial. 

The  result  In  Theorem  5  Is  quite  general  and  can  be 
applied  to  an  additive  AOV  model  In  which  the  other  factors 
are  Identified  as  the  covariates.  When  there  are  an  equal 
number  of  observations  per  cell,  balanced  data,  the  correction 
term  in  (3.2)  Is  0,  since  W^Zc  «  0.  Then  the  simpler  result 
in  (2.2)  can  be  applied  directly  to  the  average  ranks  of  the 
adjusted  data. 
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4.  Examples  and  Olscusslon 

As  an  example,  we  consider  data  from  an  age  discrimination 
suit  filed  In  a  United  States  District  Court  In  South  Carolina 
In  1980.  Because  this  case  was  settled  out  of  court,  the  names 
of  Individuals  and  the  company  Involved  will  not  be  mentioned. 

We  say  only  that  related  to  a  company  reduction  In  force,  the 
company  terminated  a  number  of  employees  based  on  evaluation 
of  job  performance,  that  those  terminated  were  near  retirement 
age,  and  that  those  Individuals  brought  suit. 

In  one  aspect  of  the  case,  twelve  Individuals  with  the 
same  job  titles  were  ranked  from  one  to  twelve  as  to  their  job 
performance,  with  "one“  representing  the  least  proficient  and 
■twelve1'  the  most  proficient.  Ho  measure  of  job  proficiency 
other  than  the  relative  rankings  was  provided  by  the  company. 
Month  and  year  of  birth  data  are  given  In  Table  4.1.  The 
company  claimed  that  age  would  have  no  statistical  bearing  on 
job  performance. 

Table  4.1  may  be  viewed  as  giving  rank  summaries  of 
observations  of  a  continuous  random  variable,  performance, 
with  twelve  Independent  samples  of  size  one,  the  samples  being 
indexed  by  birthdate.  As  the  alternative  hypothesis,  we  take 
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ha:  ©1  <  e2  <. ..<  et  >  et+1  >  et+2  >. ..>  e12,  with  at 
least  one  strict  inequality  and  t  unknown.  The  unknown 


peak  perspective  reflects  one  item  that  was  of  interest  in 


the  case.  Namely,  whether  proficiency  on  this  job  tends  to 
increase  with  age  up  to  some  point,  then  tends  to  decrease 
with  age. 


Month  and  Year 
_ of  Birth _ 

Rank  of 
Birth  Date 

Performance 

Rank 

11 

-  1915 

12 

1 

2 

-  1917 

11 

3 

12 

-  1918 

10 

4 

4 

-  1919 

9 

7 

9 

-  1921 

8 

11 

11 

-  1921 

7 

6 

12 

-  1926 

6 

10 

2 

-  1927 

5 

12 

6 

-  1929 

4 

9 

9 

1930 

3 

8 

9 

1931 

2 

5 

10 

1931 

1 

2 

Table  4.1 

We  use  an  analysis  for  general  k  and  equal  sample 
sizes.  The  test  statistic  is  to  be  sensitive  to  any  of  the 
following  equally  spaced  umbrella  patterns  c^,...,^, 
where  h  >  0: 

eo,0o-h,0o-2h,...,0o-(k-l)h  peak  at  c] 

0o,0o+h,0o,0o-h . 0Q-(k-3)h  peak  at  c2 

©o,0o+h,0o+2h,0o+h,...,0o-(k-5)h  peak  at  c3  (4.1) 


®n»®n+h,6n+2h, . .  •  ,0/»+(k-l  )h 


peak  at  c. 
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4r  ft  it 

We  use  as  the  test  statistic  =  roax{  V1 , . . . , Vk } , 
* 

where  Vt  is  given  in  (2.9).  When  .  .=\k=l/k,  then 

ctw=ct  and  we  have 


Vt 


12 


11/2 


ctj~^t 


k(N+lJ  j  [E(ctr-ct)2]1/2 


Rj 


(4.2) 


u  j  r 

Since  for  each  t  the  coefficients  of  the  Rj's  are  centered 

and  normed,  these  coefficients  *do  not  depend  on  ©Q  or  h. 

Hence,  for  t=l,2,...,k,  we  may  take  ctj=j  for  j=l,2,...,t 

and  ctj=2t-j  for  j=t+l,...,k.  Table  4.2  provides  .05  and 

* 

.01  critical  values  for  V _  when  n.=n_=. . .=n  =1 . 

max  I  z  k 


HI 

5 

6 

7 

8 

9 

10 

11 

12 

15 

20 

25 

30 

.05 

1.94 

2.02 

2.10 

2.14 

2.17 

2.19* 

2.22 

2.21 

2.26 

2.27 

2.29 

2.30 

.01 

- 

2.19 

2.31 

2.40 

2.48 

2.53* 

2.59 

2.62 

2.69 

2.75 

2.81 

2.82 

Table  4.2  Critical  values  of  the  null  distribution  of 
when  ni=n2=. .  .^n^l .  Entries  through  k*=9  are 
exact,  while  entries  thereafter  are  based  on  simulations. 


When  the  common  sample  size  is  one  and  Hq  Is  true, 

R.j  ,*R2 . .  .'Rjj  Is  a  permutation  of  the  Integers  1,2,...,k,  with 
all  k!  permutations  being  equally  likely.  One  may  generate 
on  a  computer  all  the  permutations  by  taking  as  a  seed 
permutation  12... k,  and  use  k-1  nested  loops  as  follows. 

Let  the  ith  Innermost  loop,  1=1,2 . k-1,  cycle  the  first 


1+1  positions  of  the  current  permutation  to  obtain  the  next 

permutation.  Say,  old  position  1  -*  new  position  2,  old 

position  2  -*  new  position  3 . old  (1+1  )st  position  -» 

new  position  1.  Let  this  loop  act  1+1  times,  each  time 

generating  the  next  permutation.  For  each  new  permutation 
* 

the  value  of  is  computed.  In  this  way  the  exact 
max 

* 

distribution  of  V  „  will  be  obtained.  One  also  may 
max 

obtain  approximate  critical  values  for  large  k  by  using 

random  permutations  of  the  Integers  1,2 . k  and  empirical 

distributions. 

* 

For  example,  when  k«5  the  maximum  value  of  V  .. 

max 

is  exactly  2,  occurring  when  *  12345  or  54321. 

The  next  largest  value  is  1.9414,  which  occurs  for  the 

permutations  12354,  12453,  35421,  and  45321.  Let  W  denote 
* 

the  range  of  V _  after  values  are  truncated  to  two 

max 

* 

decimal  places.  Then  1.94  ■  min  {wcW:  ^g^max  >  w)  <  .05} 

Hence  In  the  table,  1.94  Is  the  .05  critical  value  (6/5!  * 

There  Is  no  .01  critical  value.  Each  simulation  entry  in 

the  table  Is  based  on  at  least  40,000  random  permutations. 

As  an  explanation  to  the  decrease  from  2.22  to  2.21  in  the 

* 

row  of  .05  critical  values,  we  note  that  V  is  discrete 
with  gaps  In  Its  range  and  some  values  of  W  are  much  more 
likely  to  occur  than  their  neighbors.  For  Instance,  when 
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k*U.  2.20  /  W  while  PQ(V^x=2.21  )  =  .003,  PQ(V^X=2.22) 

*  .0007,  based  on  120,000  permutations. 

For  the  job  performance  data,  R.j=2,  R2=5, . . . ,R12=1  and 

v*  “2.8795  >  2.62,  so  that  Hn  is  rejected  at  the  .01 
max  •  u 

significance  level.  A  simulation  approximation  of  the  p-value 
is  .0015. 

Were  the  asymptotic  setting  of  Theorem  4  appropriate, 

*  * 

V  ,  with  V*  as  in  (4.2),  could  be  compared  to  the  entries 
max  v 

in  Table  4.3.  These  are  critical  values  of  the  maximum 
component  of  BZ  ,  where  Z  ~  HVN(0,I)  and  btj  * 

mmmm  mm  mm  mm 

(ctj“Ct)[E(ctr“Ct)2r1/2*  for  j-1,2 . k  and  t=l,2 . k, 

r 

with  ctj-j  for  j«l ,2, . . . ,t  and  ctj*2t-j  for  j»t+l . k. 

Except  for  the  entries  when  k=2,  the  critical  values  in  Table 
4.3  were  obtained  by  simulation  using  empirical  cumulative 
distributions.  When  k=2  the  two  umbrella  patterns  are  c^l, 
c12»0  and  c21«l,  c22=2.  Hence  we  seek  critical  values  for 
max(-2'1/2(ZrZ2),  2*1/2(Z1-Z2))«2“1/2|Z1-Z2I .  where 
Z.j  and  Z2  are  lid  standard  normal.  Hence  the  critical  values 
would  be  identical  to  those  for  ( | . 
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131 

2 

3 

4 

5 

6 

7 

8 

9 

10 

15  20 

30 

.05 

1 .96* 

2.12 

2.19 

2.24 

2.26 

2.27 

2.28 

2.29 

2.30 

2.31  2.31 

2.32 

.01 

2.58* 

2.73 

2.78 

2.83 

2.85 

2.86 

2.87 

2.89 

2.89 

2.91  2.91 

2.91 

Table  4.3  Approximate  (except  for  k-2)  critical  values  for 
the  limiting  null  distributions  of  V^x  when 
V| «\.2* •  •  •  *^ic=l  A  and  v£  is  given  in  (4.2). 

We  now  compare  the  unknown  peak  statistic  Aj  as 

* 

presented  by  Mack  and  Wolfe  (1981)  to  l  on  an  example 

nuxX 

which  shows  that  the  former  may  be  dramatically  affected  by 
minor  perturbations  in  the  data.  We  begin  with  their  statistic 


Ao  =  E 
l<Kj<4 


♦  E  Uji, 

4<1<j<k 


(4.3) 


used  when  the  umbrella  peak  Is  known  to  occur  a  priori  at 


4  k  k 

group  4.  Letting  N-j  =  E  n\,  No  -  E  n*  and  N  =  E  nj, 

1=1  1*4  1*1 


then  the  null  asymptotic  distribution  of  [Aj-uo(Ai)]/<jo(a8.) 
is  standard  normal  with 

k 

Uo(Afc)  =  4-1[Ni2  +  N22  -  £  n^2  -  ng,2] 

1-1 


and 

k 

Oo2(Aa)=72-1[2(Ni3+N23)  +  3(N12+N22)-  E  nj2(2nf+3)  -  ng(2ng+3) 


+  12nlN1N2-12n12N]. 
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ln  the  unknown  peak  case  the  statistic  is 

k 

A  £  *  E  Xt(At-Uo(At))/c»o(At)  (4-4) 

t*l 

where  xt  =  0  if  group  t  is  not  estimated  as  being  a  peak  group, 
and  xt*=l/r,  otherwise,  with  r  being  the  number  of  groups 
estimated  as  peak  groups.  To  estimate  which  are  the  peak 
groups  one  computes 

k 

Bt  *  E  u1t» 

1=1 

1*t 

then  B*=[Bt-vo(Bt)]/0o(Bt),  for  t-1,2 . k,  with 

VBt)=nt(N~nt)/2  and  «o2(Bt>-nt(M-nt)(^1)/12-  Group 

*  *  * 

t  Is  estimated  as  a  peak  group  if  Bt«max{B1 , . . . ,8^} . 

As  seen  in  (4.3)  Mann -Whitney  counts  are  not  computed 
for  groups  on  opposite  sides  of  the  peak.  Rather  than  looking 
at  the  overall  pattern,  a  sample  pattern  is  viewed  in  two 
pieces,  each  piece  contributing  circumstantial  evidence  or 
not  to  one  of  the  two  double  sums  In  (4.3).  The  absence  of 
across  the  peak  comparisons  can  present  a  problem,  as  we  now 
see. 

Consider  four  samples  of  size  10  with  ranks  as  In  Table 
4.4.  Group  two  is  estimated  as  the  peak  group  (82=226,  B4=225), 
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wlth  A2=251,  u0(A2)=200  and  cQ(A2)=35.05.  This  gives 
A£*1.46  and  a  p-value  satisfying  p  ».l,  by  Table  1  of 
Mack  and  Wolfe. 

Samples  1  and  2  conform  perfectly  to  the  leftmost 
monotone  increasing  pattern  of  an  umbrella,  whereas  samples 
2,  3,  and  4  conform  hardly  at  all  to  a  rightmost  monotone 
decreasing  pattern  of  an  umbrella.  Based  on  A£,  the  sum 
of  two  Jonckheere's  statistics,  we  fail  to  reject  HQ  at 
any  reasonable  significance  level. 


Sample 

1 

2 

3 

4 

10 

39 

33 

40 

9 

38 

30 

37 

Ranks 

8 

35 

27 

36 

7 

34 

24 

32 

6 

29 

21 

31 

5 

28 

18 

26 

4 

23 

15 

25 

3 

22 

13 

20 

2 

17 

12 

19 

1 

16 

11 

14 

Table  4.4 

Suppose  that  the  observations  ranked,  say,  37th  and 
38th  were  misranked,  and  should  be  switched.  Now  group  four 
is  estimated  as  the  peak  group  (B3=225,  8^*226)  and  A4=450, 
with  u0(A2)=300  and  «0(A4)*41 .4.  Hence  A£*3.62,  highly 
significant,  with  a  p-value  satisfying  p  «  .01.  When 
Mann-Whitney  counts  from  all  possible  pairs  of  groups  are 
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brought  to  bear,  there  Is  great  support  for  an  overall  monotone 
Increasing  trend  In  group  locations.  Thus  the  failure  to 

compare  groups  across  the  peak  can  be  problematic. 

*  — 

To  use  V  and  the  data  In  Table  4.4  we  need  R,«5.5, 
max  i 

R,«28.1,  R,«20.4,  and  R.-28.0.  This  gives  V*  *3.617.  With 
c  j  max 

* 

the  37  and  38  exchanged,  ^*3.641.  Either  case  Is  highly 
significant,  according  to  Table  4.3. 


APPENOIX 


Proof  of  Theorem  3 


We  find  the  values  of  a-j.ag . a.,  which  maximize 

(Ea^/tEa2^)1'2  (1) 

subject  to  Eaj=0,  where  c-j,...,^  and  are  known. 

2  1/2 

Define  f(a1,...,a)c,X)=(EajCj)/(Eaj/Xj)  +  XEa y 

Then  3f/3a1»A_2{Ac1-(EajCj)a^\^1A~1 }  +  X,  where 

A*(Eaj/\j)  '  .  Set  3f/3a^=0  for  1=1,2 . k  and  use  the 

fact  that  Eaj=0.  First, 

{A^^-tEajCjJa^/A3  +  XX^O  (2) 

for  all  1.  Sum  the  k  equations  In  (2)  to  get  A2(ECjXj)= 

-XA3EXj,  or  \=-(ECj\j)/A.  Substituting  In  (2)  and 

2 

solving  for  a^  yields  a^X^  (c^-EXjCjJ/EajCj. 

When  a*  Is  a  critical  vector  of  expression  (1),  so  Is  any 
scalar  multiple  of  a*.  Hence  we  may  reduce  our  considerations 
to  the  critical  vectors  a*  and  -a*,  where  ai*=Xi(cj-cw)  with 
Cy-EXjcj.  Notice  that  the  vectors  a*  and  -a*, when 
substituted  Into  expression  (1),  yield  results  of  opposite 
sign,  and  these  are  the  only  critical  vectors,  up  to  scalar 
multiples. 

To  see  that  a  =  a*  maximizes  (1),  we  use  the  Cauchy-Schwarz 

M 

Inequality.  Now 

(Ea1c1)/(Ea2/\1)'/2  -  [£(a1\"1/2)\J/2(c1-Ew))/(Ea2/\)),/2 

<  1E\1(c1-cw)2}’/2  . 


and  algebra  shows  that  when  a  -  a*,  expression  (1)  equals 

mm  mm 

the  upper  bound.  Hence  a*  is  the  unique  solution  which 
maximizes  expression  (1). 


Proof  of  (2.151 .  Let  vij“(dj_di)uij/ninj • 

Var(E  E  V<<)  -EE  Var(V-M)  +  Covariance  terms. 

1<j  1<j 

rj-1  Ic 

Covariance  terms  «  E  E  E  Cov(Vij,Vsj)  E  Cov(Vi<,Vu) 

1<j  s-1  t«1+l 

Ls*1  t*j 

1-1  k 

+  E^  Cov(V1:j.Vs1)  +  E  Cov(V^ j ,Vjt) 

s*j  t*1 


E  E  E  (dj-di)(dj-ds) 

1<J  [_s-l  J  1 2n j 


E  (clj-xllXdt-Ol)  -J- 

t-Hl  12ni 


♦  E  (dj-dilldi-dsJfji- 
s-1  12ni 


where  the  covariances  are  given  In  Tryon  and  Hettmansperger 
(1973,  Section  4).  Now 


covariance  terms 


[k  k 

E  (dj-ds)  1  +  E  (dt-dO 
s-1  n<  t-1 

s*1,j  t*1,j 


i*  E  E  (dj-di)  (I  [(k-2)di+dfKlj]  -  1  [(k^Jdi+dt+dj]} 
12  1<J  nj  n^ 


i  r  /dJ 

1  EE  (dj-di)  k  — - 

12  1<j  nj  nj 


-  (dj-di) 


.  I"1  "J/J 


dj  djl 

«  ^  E  E  (d<-di)  - - 1  EE  (dj-di)2  I  ♦  1 

12  1<J  J  \nj  n^j  12  lnj  nj 

2 

Further,  Var  «  (d^-d^)  (n^-Hij+l)/(12n^nj)  and 

(2. IS)  follows  by  combining  the  variance  and  covariance 

terms. 


Proof  of  Theorem  5. 

* 

Let  W  be  the  matrix  formed  by  dropping  the  first 
column  of  W.  Consider  the  model  Y  «  la  W*0*  ♦  Zy  ♦  e 

MM  M  MM 

where  0*'  -  (02, . . .  ,0|<) .  Now  the  design  X  *  [W*,Z]  Is  such 
that  N~i X^XC  has  a  positive  definite  limit. 


sb**\2,  T1 '  ri 


for  j«2,...,k,  where  Rj  Is  based  on  Y  -  Z  y. 

M  M 

Let  N”1/2S2  be  the  (k-l)xl  vector  of  such  components, 


.1/2,  a 


Now,  when  N  (Y~Y)  is  bounded  In  probability  under  Hq 


for  large  N,  the  result  In  the  proof  of  Theorem  5.3.2, 
Hettmansperger  (1984),  shows  that 


J S2*  -*  Z  ~  MVN(0,A*) 

n1/2  ~ 

where  A*  =  11m  N'Hw^'Wc*  -  Wc*,Zc(Zc,Zcr1Zc'wc*]. 
Now  define 

S2,«(S1.S2*'). 

0m  0m 


Since,  Si  »  -  E  S<,  we  have 
j-2 


1-1/2 


-1 


N-1/2s2* 


Z  -  MVN(O.A) 


where  A  -  lira  N-TfWc'Wc  -  Wc'ZcUc'Zc^Zc'Wc]. 
Now, 


k 

E 

j-1 


\j<cj-Cw)*j 


k 

E 

j-1 


^(Cj-tw) 


%)( «j 


-1/2  k 
-  N  E 

j-1 


(Cj-cw)  Sj 


which  converges  In  distribution  to  a  normal  random  variable 
with  mean  0  and  variance  c*Ac  where  c'  ■  (c-|-cw, ...^ic-Cy). 


Finally,  note  that  N_lwc'wc  has  1th  diagonal  element 
\i(l-\i)  and  off  diagonal  elements  -\i\j.  Straightforward 
calculation  shows  E\j(cj-cw)2  ■  c'(N"lwc'wc)c, 


and  the  theorem  Is  proved. 
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